DE eng

Search in the Catalogues and Directories

Hits 1 – 13 of 13

1
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
In: https://hal.inria.fr/hal-03540069 ; 2022 (2022)
BASE
Show details
2
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection ...
BASE
Show details
3
SIGTYP 2020 Shared Task: Prediction of Typological Features ...
BASE
Show details
4
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information ...
BASE
Show details
5
Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness ...
BASE
Show details
6
Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset ...
BASE
Show details
7
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020)
BASE
Show details
8
UniMorph 3.0: Universal Morphology
In: Proceedings of the 12th Language Resources and Evaluation Conference (2020)
BASE
Show details
9
UniMorph 3.0: Universal Morphology ...
BASE
Show details
10
The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection ...
BASE
Show details
11
Spell Once, Summon Anywhere: A Two-Level Open-Vocabulary Language Model ...
Mielke, Sabrina J.; Eisner, Jason. - : arXiv, 2018
BASE
Show details
12
Are All Languages Equally Hard to Language-Model? ...
BASE
Show details
13
Unsupervised Disambiguation of Syncretism in Inflected Lexicons ...
Abstract: Lexical ambiguity makes it difficult to compute various useful statistics of a corpus. A given word form might represent any of several morphological feature bundles. One can, however, use unsupervised learning (as in EM) to fit a model that probabilistically disambiguates word forms. We present such an approach, which employs a neural network to smoothly model a prior distribution over feature bundles (even rare ones). Although this basic model does not consider a token's context, that very property allows it to operate on a simple list of unigram type counts, partitioning each count among different analyses of that unigram. We discuss evaluation metrics for this novel task and report results on 5 languages. ... : Published at NAACL 2018 ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences
URL: https://arxiv.org/abs/1806.03740
https://dx.doi.org/10.48550/arxiv.1806.03740
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
13
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern